首页> 外文OA文献 >Automated languages phylogeny from Levenshtein distance
【2h】

Automated languages phylogeny from Levenshtein distance

机译:Levenshtein距离的自动语言系统发育

摘要

Languages evolve over time in a process in which reproduction, mutation andextinction are all possible, similar to what happens to living organisms. Usingthis similarity it is possible, in principle, to build family trees which showthe degree of relatedness between languages. The method used by modern glottochronology, developed by Swadesh in the1950s, measures distances from the percentage of words with a common historicalorigin. The weak point of this method is that subjective judgment plays arelevant role. Recently we proposed an automated method that avoids the subjectivity, whoseresults can be replicated by studies that use the same database and thatdoesn't require a specific linguistic knowledge. Moreover, the method allows aquick comparison of a large number of languages. We applied our method to the Indo-European and Austronesian families,considering in both cases, fifty different languages. The resulting trees aresimilar to those of previous studies, but with some important differences inthe position of few languages and subgroups. We believe that these differencescarry new information on the structure of the tree and on the phylogeneticrelationships within families.
机译:语言随着时间的流逝而发展,在这个过程中,繁殖,突变和灭绝都是可能的,这与活生物体类似。使用这种相似性,原则上可以构建显示语言之间相关程度的族谱。斯瓦德什(Swadesh)在1950年代开发的现代地理年代学方法测量距离与具有共同历史起源的单词所占的百分比。这种方法的弱点是主观判断起着重要的作用。最近,我们提出了一种避免主观性的自动方法,该方法的结果可以通过使用相同数据库且不需要特定语言知识的研究来复制。此外,该方法允许快速比较多种语言。我们将方法应用于印欧语系和南洋语系,在这两种情况下都考虑了五十种不同的语言。生成的树与以前的研究类似,但是在少数语言和子组的位置上有一些重要差异。我们认为,这些差异为树的结构和家庭内部的系统发育关系带来了新的信息。

著录项

  • 作者

    Serva, Maurizio;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号